Refactor `ModelBuilder` into smaller classes #1870

ColtAllen · 2025-08-04T23:29:42Z

Description

@williambdean also deserves contributor credit for starting this PR.

ModelBuilder was originally developed to build models with an API very similar to the supervised learning models in scikit-learn. This works fine for MMMs, but creates considerable tech debt when different APIs are required. This PR is the first of several to clean up accumulated tech debt by dividing ModelBuilder into three separate classes:

Click to toggle UML diagram

classDiagram

class ModelIO {

- str _model_type

+ str version

+ az.InferenceData | None idata

+ dict sampler_config

+ dict model_config

+ id(self) str

- _serializable_model_config(self) dict[str, int | float | dict]

+ create_idata_attrs(self) dict[str, str]

+ set_idata_attrs(self, idata) az.InferenceData

+ save(self, fname, **kwargs) None

- @classmethod _model_config_formatting(cls, model_config) dict

+ @classmethod attrs_to_init_kwargs(cls, attrs) dict[str, Any]

+ @classmethod idata_to_init_kwargs(cls, idata) dict[str, Any]

+ build_from_idata(self, idata) None

+ @classmethod load(cls, fname, check)

+ @classmethod load_from_idata(cls, idata, check) "ModelIO"

}

  

class ModelBuilder {

+ str _model_type

+ str version

- __init__(self, model_config, sampler_config) None

+ default_model_config(self) dict

+ default_sampler_config(self) dict

+ build_model(self, **kwargs) None

+ fit(self, **kwargs) None

+ graphviz(self, **kwargs)

+ table(self, **model_table_kwargs) Table

+ fit_result(self) xr.Dataset

+ fit_result(self, res) None

+ prior

+ prior_predictive

+ posterior

+ posterior_predictive

+ predictions

}

  

class RegressionModelBuilder {

- _validate_data(self, X, y)

- _data_setter(self, X, y) None

+ output_var(self) str

+ build_model(self, X, y, **kwargs) None

+ build_from_idata(self, idata) None

+ create_fit_data(self, X, y) xr.Dataset

+ post_sample_model_transformation(self) None

+ fit(self, X, y, progressbar, random_seed, **kwargs) az.InferenceData

+ predict(self, X, extend_idata, **kwargs) np.ndarray

+ sample_prior_predictive(self, X, y, samples, extend_idata, combined, **kwargs)

+ sample_posterior_predictive(self, X, extend_idata, combined, **sample_posterior_predictive_kwargs)

+ predict_proba(self, X, extend_idata, combined, **kwargs) xr.DataArray

+ predict_posterior(self, X, extend_idata, combined, **kwargs) xr.DataArray

}

ModelBuilder --|> `abc.ABC`  

ModelBuilder --|> ModelIO

RegressionModelBuilder --|> ModelBuilder

The CLV models still inherit from a stripped-down ModelBuilder class, but all other models now inherit from RegressionModelBuilder. If the MNLogit and Nested Logit models are modified to inherit from ModelBuilder instead, the internals of both could be cleaned up quite a bit.

Related Issue

Closes Consolidate CLV load with ModelBuilder classmethod #1380
Related to Option to exclude fit data when saving models #1356

Checklist

Checked that the pre-commit linting/style checks pass. Feel free to comment pre-commit.ci autofix to auto-fix.
Included tests that prove the fix is effective or that the new feature works
Added necessary documentation (docstrings and/or example notebooks) using numpydoc format.
If you are a pro: each commit corresponds to a relevant logical change

📚 Documentation preview 📚: https://pymc-marketing--1870.org.readthedocs.build/en/1870/

…lbuilder

ColtAllen · 2025-08-04T23:32:26Z

@williambdean do you still think the check parameter for load is necessary?

codecov · 2025-08-04T23:32:41Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 92.15%. Comparing base (32ed4db) to head (f7295af).
⚠️ Report is 3 commits behind head on main.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1870      +/-   ##
==========================================
+ Coverage   91.89%   92.15%   +0.26%     
==========================================
  Files          64       64              
  Lines        7577     7587      +10     
==========================================
+ Hits         6963     6992      +29     
+ Misses        614      595      -19

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

williambdean

Looking through. Not many objections. What else is needed here?

pymc_marketing/model_builder.py

ColtAllen · 2025-08-11T21:35:04Z

Looking through. Not many objections. What else is needed here?

Just your thoughts on including the check parameter for the load function. The only justification I see for including it is if backwards-compatibility with older library versions is desirable.

Testing coverage is 99% locally; not sure why the CodeCov bot is always lower than the pytest plugin.

pymc_marketing/clv/models/basic.py

juanitorduz · 2025-08-18T18:49:39Z

From my side looks great! @williambdean, anything we are missing?

juanitorduz · 2025-08-20T07:47:53Z

As there are no comments I suggest we merge this on :)

Amazing work @ColtAllen !

PabloRoque · 2025-08-20T08:36:13Z

It was a biggie, hadn't found the time, sorry 🙈

juanitorduz · 2025-08-20T10:26:44Z

It was a biggie, hadn't found the time, sorry 🙈

All good! Thank you @PabloRoque 💪

williambdean and others added 28 commits February 4, 2025 11:47

break out into smaller classes

d6e15a8

change the default to True

e9f6f5c

Merge branch 'main' into break-down-modelbuilder

ad351e2

Merge branch 'main' into break-down-modelbuilder

b8c1791

Merge branch 'main' into break-down-modelbuilder

63cad51

WIP unit testing

94347b8

requires_model methods

3085840

test_graphviz

9b03b07

TODOs and fix MOdelBuilder tests

83dfdc8

remove test model file

3dd6fce

fix customer and MMM tests

f733057

WIP fix clv tests

536a252

Merge branch 'main' into break-down-modelbuilder

6416c01

Merge branch 'pymc-labs:break-down-modelbuilder' into break-down-mode…

21d87a6

…lbuilder

fix clv base tests

5ac567b

fix remaining clv tests

59fa1bd

docstrings

c7cfa41

docstring edit

b4c98ee

remove idata_to_init_kwargs

f9a62ad

Merge branch 'main' into break-down-modelbuilder

6e1816e

CLV models inherit from BaseModelBuilder

ab0ae5f

Merge branch 'pymc-labs:break-down-modelbuilder' into break-down-mode…

d3a325e

…lbuilder

WIP test cleanup

810ac86

fix base model tests

1a9701a

clean up tests

58ab9e3

testing coverage

77aa802

add abstract methods to ModelBuilder

a06c6ba

docstrings

b523bc3

ColtAllen added this to the 0.16.0 milestone Aug 4, 2025

ColtAllen requested a review from williambdean August 4, 2025 23:29

ColtAllen added the model components Related to the various model components label Aug 4, 2025

github-actions bot added CLV MMM customer choice Related to customer choice module labels Aug 4, 2025

ColtAllen added the priority: high label Aug 5, 2025

This was referenced Aug 5, 2025

Consolidate model fitters into ModelBuider #1871

Open

Break ModelBuilder into smaller classes #1467

Closed

Merge branch 'main' into break-down-modelbuilder

0d5a27c

williambdean reviewed Aug 11, 2025

View reviewed changes

pymc_marketing/model_builder.py Outdated Show resolved Hide resolved

ColtAllen and others added 2 commits August 11, 2025 15:12

Merge branch 'main' into break-down-modelbuilder

760a848

docstrings return model_config

9fb252e

williambdean requested a review from juanitorduz August 12, 2025 02:39

PabloRoque reviewed Aug 12, 2025

View reviewed changes

pymc_marketing/clv/models/basic.py Show resolved Hide resolved

ColtAllen and others added 2 commits August 12, 2025 07:26

Merge branch 'main' into break-down-modelbuilder

040100c

remove duplicated self.data attr

163d772

juanitorduz requested review from PabloRoque and williambdean August 13, 2025 07:44

Merge branch 'main' into break-down-modelbuilder

eb26441

juanitorduz approved these changes Aug 18, 2025

View reviewed changes

Merge branch 'main' into break-down-modelbuilder

f7295af

juanitorduz enabled auto-merge (squash) August 20, 2025 07:47

juanitorduz merged commit ad84e90 into pymc-labs:main Aug 20, 2025
29 of 31 checks passed

ColtAllen deleted the break-down-modelbuilder branch August 20, 2025 23:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Refactor `ModelBuilder` into smaller classes #1870

Refactor `ModelBuilder` into smaller classes #1870

Uh oh!

ColtAllen commented Aug 4, 2025 •

edited by github-actions bot

Loading

Uh oh!

ColtAllen commented Aug 4, 2025

Uh oh!

codecov bot commented Aug 4, 2025 •

edited

Loading

Uh oh!

williambdean left a comment

Uh oh!

Uh oh!

ColtAllen commented Aug 11, 2025

Uh oh!

Uh oh!

juanitorduz commented Aug 18, 2025

Uh oh!

juanitorduz commented Aug 20, 2025

Uh oh!

Uh oh!

PabloRoque commented Aug 20, 2025

Uh oh!

juanitorduz commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Refactor ModelBuilder into smaller classes #1870

Refactor ModelBuilder into smaller classes #1870

Uh oh!

Conversation

ColtAllen commented Aug 4, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Related Issue

Checklist

Uh oh!

ColtAllen commented Aug 4, 2025

Uh oh!

codecov bot commented Aug 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

williambdean left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ColtAllen commented Aug 11, 2025

Uh oh!

Uh oh!

juanitorduz commented Aug 18, 2025

Uh oh!

juanitorduz commented Aug 20, 2025

Uh oh!

Uh oh!

PabloRoque commented Aug 20, 2025

Uh oh!

juanitorduz commented Aug 20, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Refactor `ModelBuilder` into smaller classes #1870

Refactor `ModelBuilder` into smaller classes #1870

ColtAllen commented Aug 4, 2025 •

edited by github-actions bot

Loading

codecov bot commented Aug 4, 2025 •

edited

Loading